Picture for Jiazhao Zhang

Jiazhao Zhang

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Add code
May 28, 2026
Viaarxiv icon

Habitat-GS: A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting

Add code
Apr 14, 2026
Viaarxiv icon

NavGSim: High-Fidelity Gaussian Splatting Simulator for Large-Scale Navigation

Add code
Mar 16, 2026
Viaarxiv icon

SPAN-Nav: Generalized Spatial Awareness for Versatile Vision-Language Navigation

Add code
Mar 10, 2026
Viaarxiv icon

LDA-1B: Scaling Latent Dynamics Action Model via Universal Embodied Data Ingestion

Add code
Feb 12, 2026
Viaarxiv icon

ReWorld: Multi-Dimensional Reward Modeling for Embodied World Models

Add code
Jan 18, 2026
Viaarxiv icon

TrackVLA++: Unleashing Reasoning and Memory Capabilities in VLA Models for Embodied Visual Tracking

Add code
Oct 08, 2025
Viaarxiv icon

RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction

Add code
Jul 23, 2025
Viaarxiv icon

BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion

Add code
Jun 18, 2025
Viaarxiv icon

OctoNav: Towards Generalist Embodied Navigation

Add code
Jun 11, 2025
Viaarxiv icon